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A first measurement of the cross section of single top quarli production in the t channel in pp collision at 
y/s = 7 TeV is presented. The measurement is performed on a data sample corresponding to an integrated 
luminosity of 35.9 pb~^ recorded at the LHC with the CMS detector. Leptonic decay channels with an electron 
or a muon in the final state are considered. After a selection optimized for the i-channel mode, two different 
and complementary analyses have been performed. Both analyses confirm the Tevatron's observation of single 
top, and their combination measures a cross section of cr = 83.6 it 29.8 (stat. + syst.) it 3.3 (lumi.) pb which is 
consistent with the Standard Model prediction. 



1. Introduction 

The existence of single top production has been estabhshed by the DO and CDF experiments at the Tevatron 
pp colhder [2-0] and the first measurements of individual production mechanisms have recently appeared [H [S] . 
Three different production mechanisms are foreseen in the Standard Model (SM) t channel, s channel, and tW 
(or VF-associated) . In 7 TeV proton-proton collisions the t-channel mode, is by far the most abundant of the 
three mechanisms and it is the one with the most striking final state topology. Next-to-leading order (NLO) 
computations predict at^4FS = 59.114'q pb in the 4-flavour scheme and at^5FS — 62.3l2'4 pb in the 5-flavour [B], 
for a top mass of mt — 172.5 GeV/c^. 

We present here first evidence for t-channel single top quark production in pp collisions at ^/s = 7 TeV at 
the LHC, with a first measurement of the production cross section [7]. The results are based on the data 
sample sample collected in 2010 by the Compact Muon Solenoid (CMS) experiment [5|, and corresponds to an 
integrated luminosity of 35.9 ± 1.4 pb""'^. 

This measurement is performed in the leptonic decay channel, in which the W boson decays into an electron 
or a muon. The t-channel production mode is treated as signal and the other two production modes will be 
considered as background. 

After a dedicated event selection, two complementary analyses are performed. In the first analysis, referred to 
as the 2D-analysis, a data driven method using two angular properties specific to t-channel top quark production 
will be used. In the second analysis, referred to as the BDT-analysis, the overall compatibility of the signal 
event candidates with the Standard Model expectations of electroweak top quark production is probed by using 
a multivariate analysis technique. 



2. The event selection 

Both analyses employ similar reconstruction techniques and selection criteria. Signal events are characterized 
by exactly one isolated muon or electron and missing transverse energy from the leptonic decay of the W boson 
as well as by one central b-}et from the top quark decay and an additional light-quark jet from the hard scattering 
process. The latter is found most often in the forward direction. 

Events are required to pass either a single electron trigger or a single muon trigger. The minimum E^j^ 
requirement for the electron trigger ranged from 10 GeV to 22 GeV, while the minimum p^ requirement for 
the muon trigger ranged from 9 GeV to 15 GeV. The selected data sample is used both for the selection of 
the signal and for signal-depleted control regions used for data-driven background studies. Therefore no lepton 
isolation criteria were used at trigger level, in order to allow background estimations based on samples failing 
these criteria. 

After offline reconstruction, events are selected requiring exactly one isolated lepton (electron or muon) with 
Pt > 20 GeV/c and \ri\ < 2.4 for muons and p^ > 30 GeV/c, \r]\ < 2.5 for electrons, and exactly two jets 
{pt > 30 GeV/c, |ry| < 5). In the 2D analysis, only the jets and the missing transverse energy (^t) are 
reconstructed with the particle flow algorithm 9J, while in the BDT-analysis, all objects are reconstructed with 
the particle flow algorithm. 

In order to reduce the large background from W+ light partons, one of the two selected jets is required to be 
identified as a b-jet according to the tight selection criteria of the tracit counting (TC) 6-tagging algorithm 110) . 
To further reduce the background, the 2D-analysis requires that the second jet is not tagged according to a looser 
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selection criteria, since most of the signal events are expected to have only one h quark inside the acceptance 
of the tracking detectors (|?7| < 2.5). The BDT-analysis does not make this requirement in order to increase 
statistics and profit from the larger separation power of the BDT discriminant. 

One the other hand, to remove the kinematic region where the two identified jets arc back-to-back, the 
BDT-analysis requires the two selected jets to satisfy A0(ji,j2) < 3.0. This region is found to be poorly 
reproduced by the simulation in a sample enriched in M^-|-light partons, affecting some of the observables used 
in the analysis. 

Finally, to further suppress contributions from processes where the lepton does not come from a leptonically 
decaying W boson, the transverse mass is required to be Mt > 40 GeV/c^ for muon events and Mt > 50 GeV/c^ 
for electron events. The transverse mass is defined as 



where the neutrino momentum vector is assumed equal to the missing transverse energy (^t)- 

In data, the number of selected events is 72 in the electron and 112 in the muon channels in the 2D-analysis, 
and 82 in the electron and 139 in the muon channels in the BDT-analysis. 

2.1. Top quark reconstruction 

Both analyses require the reconstruction of the 4-momcntum of the top-quark candidate. A constrained 
kinematic fit is used to reconstruct the complete kinematics of the event under the hypothesis that it is a single 
top event decaying into a lepton+jets final state. This leads to a quadratic equation in the longitudinal neutrino 
momentum, Pz,u- Solutions to this equation can have an imaginary part when Mt is larger than the W pole 
mass used in the constraint. The imaginary component is then eliminated by modifying the_^T such as to give 
Mt = MiY, still respecting the W mass constraint. When two real solutions are present, which happens in 
77.6% of the cases, the solution with the smallest \pz,i/\ is chosen, which, in simulated events, is correct in 60.3% 
of the cases. 

Choosing the jet with the highest 5-tagging discriminant as the jet originating in the decay of the top quark 
is correct in simulated events in 92.6% (87.4%) of the cases after the 2D (BDT) selection. The non-tagged jet 
is matched to the recoil quark in 89.6% (84.0%) of the cases. 

3. Background Estimation 

With a relatively small signal and a large background, one crucial element of the analysis is the determination 
of the background. Data-driven methods are thus used to estimate the main backgrounds. The event yields for 
the two selections are summarized in Table IXSl 



This analysis probes a very specific kinematical phase space populated only by the tails of the QCD dis- 
tributions. This, despite the excellent agreement of the CMS simulation with data, together with the small 
number of selected events in the simulation, makes the estimate of this background on simulated events not 
very significant. 

The normalization of this background is estimated by a profile likelihood fit to the Mt distribution after all 
other selection criteria have been applied, by parametrizing the Mt distribution as F{Mt) — a ■ S{Mt) + b ■ 
B{Mt), where S{Mt) and B{Mt) are templates for signal-like (leptons coming from W decays) and QCD-like 
events, respectively. The S{Mt) template is taken from the simulation, while the B{Mt) template is extracted 
from a high statistics background dominated data sample composed mainly of QCD events. This sample is 
obtained by removing the 6-tagging and Mt requirements and inverting the isolation cut. These requirements 
reject most of the signal-like events (single top, W + X, ti, and in general any process with a charged lepton 
from an intermediate W boson). 

This procedure yields the following predictions for the number of QCD events passing the Mt threshold in 
the 2D analysis: 




(1) 



3.1. QCD estimation 



N: 



N: 



t2D _ 
qcd 

r2D ^ 
qcd 



0.62 ± 0.12 (stat.) ± 0.08 (shape) ± 0.15 (stability) 
2.6 ± 0.6 (stat.) ± 3.1 (shape) ± 1.2 (stability) 



(muons) 
(electrons) 
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Process 


SF from region A 


SF from region B 


(1 channel 
e channel 


1.02 ±0.03 
0.97 ±0.04 


1.27 ±0.09 
1.05 ±0.11 



Table L Scale factors for VK±light partons predicted by the fits in control regions A and B in the 2D analysis. Uncertainties 
are statistical only. 



and in the BDT analysis: 

iVg^f^ = 4.92 ± 0.99 (stat.) ± 0.05 (shape) ± 0.81 (stability) (muons) 
Nf^^^ = 5.27 ± 1.24 (stat.) ± 0.79 (shape) ± 3.23 (stability) (electrons) 

where "shape" indicates the systematic uncertainty coming from the B{Mt) model and "stability" indicates 
the maximum variation between the results when varying the fit range. The central values of these predictions 
are used in the analyses, while the uncertainties on these values are conservatively taken as ±50% in the muon 
decay channel in both analyses, ±100% in the electron decay channel in the BDT analysis and ^^[qq^ in the 2D 
analysis. 



3.2. VT+light partons estimation 

The VF±light partons background is treated differently in both analyses. In the BDT analysis, the M^+light 
partons yield is treated as a nuisance parameter in a fully Bayesian procedure. In the 2D analysis, partially 
data-driven methods are used to extract the normalization and the kinematics. The same factor is then also 
used for Z+ jets. 

A suitable control sample, dominated by the W ± light flavors background, is obtained with an orthogonal 
selection where the events are required to have one isolated lepton and exactly two jets. One of the jets is required 
to be "taggable", i.e., within the tracker acceptance and with at least two tracks satisfying the quality selection 
of the 6-tagging algorithm. Both jets should fail the tight 6-tagging selection. To model the distributions of 
the variables used in this analysis in W ± light flavor background events in the signal region, the distributions 
obtained in this VF— enriched sample in data will be used, after subtracting the other contributions (including 
signal, which accounts for roughly 1% of this sample) estimated with simulated samples. 

To estimate the scale factor for the IF±light partons background components both this VF-enriched control 
sample (control sample A) and a subset where at least one jet passes the loose 6-tagging selection (but fails the 
tight one - control sample B) are used. In both samples a fit on the full Mt distribution is performed. The 
QCD and W^+light partons components are free parameters in the fit, while all other processes, including the 
heavy flavour components and the ^-channel signal, are constrained to the expected values. The scale factors 
between the number of events in the M^-enriched control sample in data and simulation found are given in 
Table D 

The 2D analysis takes as central predictions those from control sample B, upon the argument that it is closer 
to the signal region, obtaining an expectation of 18.2 (116) Ty±light parton events in the signal region for the 
muon (electron) decay channel. An uncertainty of ±30% (±20%) is then used, which covers both the statistical 
uncertainty from the fit and the difference between both predictions. The same scale factors are applied to 
2'±jets. 



3.3. Other bacl^ground contributions 

The VQQ and Wc components are scaled to LO values and, on top of this correction, by further factors 2 ± 1 
and lio 5, respectively, in order to take into account the results of the tt cross section measurement exploiting 
^-tagging [TT], from which the ti cross section itself is taken. The theory prediction is used for VV ;12^ and 
single top in s [13] and tW [14] channels. The uncertainties on these values are considered as components of 
the systematic uncertainty. The BDT analysis treats the normalization of these backgrounds as a nuisance 
parameter in the fit, with Gaussian constraints corresponding to the systematic uncertainty. 
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Table II: Event yields summary, including data-driven estimations and fo-tagging scale factors. The signal (f ) is normalized 
to the 5-flavour computation with the corresponding uncertainty [Q. 



Process 


2D, ^ channel 


2D, e channel 


BDT, /I channel 


BDT, e channel 


single top, t channel (f) 


i / .0 ± O.i 


± U.4 


i ( .D ± 0. / 


iU. / ± 0.0 


single top, s channel 


0.9 ± 0.3 


0.6 ± 0.2 


1.4 ± 0.5 


1.0 ± 0.3 


single top, tW 


3.1 ± 0.9 


2.4 ± 0.7 


3.8 ± 1.1 


< 0.1 


WW 


0.29 ± 0.09 


0.23 ± 0.07 


0.32 zb 0.10 


0.23 zb 0.07 


VV Zj 




n 1 7 + n fTi 


U.OO ZC U.IU 


1 c _|_ f) zL 

L.O ZIZ U.^: 


Zj Zi 


n m o_i_ n hak 


U.Oii ± O.OUo 


A AOA _l_ A AA/^ 

U.UzU ± U.UUo 


< U.i 


W light partons 


lo.2 dz 0.0 


lie _l_ o o 
ii.D ± 1.6 




T A _l_ Q K 


Zj -\- A. 


1 7 1 n tr 
1. ( =h U.O 


i.D ± U.O 


fi 7 1 AO 

U. / dz U.z 


A AC _|_ A HQ 

U.UO dz U.Uo 




n _|_ n o 
u.u ZC U.O 




4 A _|_ 9 c 

4:.y ziz z . o 


c; q _|_ c q 

O.O ZIZ U.O 


VQQ 


20.4 ± 10.2 


14.1 ± 7.1 


17.6 ± 8.8 


11.7 ± 5.8 


Wc 


12 9 +^^-^ 

^^■^ -6.5 


9.4 


-4.6 


0.» _2.9 


tt 


20.3 ± 3.6 


15.6 ± 2.8 


34.9 ± 4.9 


22.9 ± 3.2 


Total background 


78.6 ± 15.2 


58.4 ± 11.0 


82.4 ± 13.1 


55.9 ± 10.2 


Signal + background 


96.2 ± 15.3 


69.6 ± 11.0 


100.0 ± 13.2 


66.6 ± 10.2 


Data 


112 


72 


139 


82 



4. The analyses 
4.1. The 2D Analysis 

The cross section is determined by performing an unbinned likelihood fit to the 2D distribution of two 
variables, cos6'j*- and rjij. The distributions of these two variables are shown in Fig.jlj 

A property specific to single top production is the almost 100% left-hand polarization of the top quark due 
to the V — A structure of the electroweak interaction [TJl [TS] . Because the lifetime of the top quark is shorter 
than the hadronization scale, the direction of the top-quark spin is visible in angular correlations of its decay 
products. These are distributed according to 



(2) 



where 0;*- is the angle between the direction of the outgoing lepton and the spin axis, approximated by the 
direction of the untagged jet, in the top-quark rest frame. A is the coefficient of spin asymmetry, equal to -1-1 
for charged leptons. 

Another important feature of the signal is the presence of a recoil jet, from the fragmentation of a light 
(untagged) quark, with a characteristic rj distribution. 

The inputs to the fit are the template distributions for signal and backgrounds, with separate templates for 
each lepton. For the backgrounds, a w 2% correlation is neglected and the 2D distribution is taken as the 
product of the ID templates. The shapes of the discriminating variables for the QCD and VF-|-light partons 
components are taken from the control samples, while all other shapes are taken from simulation. The overall 
background floats unconstrained in the fit, while its relative components are fixed according to the background 
estimates. 



4.2. The BDT Analysis 

This analysis assesses the compatibility of the data with the Standard Model predictions of electroweak top 
quark production using a multivariate analysis method. Boosted Decision Trees (BDT) are used, with 1000 
decision trees and the ADA boosting algorithm as implemented in the TMVA package |17j . 

A total of 37 observables reconstructed in the detector have been chosen from five categories. The validity of 
the description of the input variables in the simulation has been checked using a Kolmogorov-Smirnov test in the 
orthogonal W^-enriched control sample. The first type of observables covers the kinematics and properties of the 
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Figure 1: Cosine of the angle between charged lepton and untagged jet in the reconstructed top rest frame (cosSy), left, 
and pseudorapidity of the untagged jet {rjij), right, after the full event selection, combining the muon and electron decay 
channels. The dip at cos 0;* 1 is due to the lepton pr and Mt selection cuts. 



c - CMS, 36 pb"\\/s = 7 TeV 

0) - 




-0.8 -0.6 -0.4 -0.2 
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Figure 2: Boosted decision tree discriminant [hdt) after the dedicated BDT selection, combining the muon and electron 
decay channels. Predicted backgrounds are scaled to the medians of their posteriors from the fit. 



leptons and the jets (this includes rnj), while the second type refers to correlations between these objects. A third 
type results from properties of their combinations, the W-boson, the top quark, and the sum of the hadronic 
four- momenta. A fourth type of observables, which includes cos 9ij, exploits the angular distributions between 
original (lepton, jet) and derived objects {W, top quark, etc.). A fifth type are the event related observables, 
such as the sphericity and the total and transverse energies contained in the parton collision process. In all 
these observables, the description of the measured distributions by the simulated data is found to be reasonable 
within the theoretical uncertainties. The most important observables are the lepton momentum, s defined as 
the mass of the system formed by the reconstructed W boson and the two jets, the of the system formed 
by the two jets, the pt of the most 6-tagged jet, and the reconstructed top mass. The bdt classifier has been 
validated both in simulation and in data. It is shown in Fig. [2] 

The cross section is then extracted from a binned likelihood fit to the bdt distribution with a Bayesian 
approach, where the normalizations of all backgrounds except the multi-jet QCD background and the other 
systematic uncertainties are treated as nuisance parameters in the fit. For the multi-jet QCD background, the 
data-driven estimate is used. 
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Analysis, channel 


expected 


observed 


2D, /x-channel 




2.5 


2D, e-channel 




3.1 


2D, combined 


2 1+1° 
^■^-1.1 


3.7 


BDT, /i-channel 


2 4+0.9 


3.1 


BDT, e-channel 


2.0±1.0 


1.9 


BDT, combined 


^•^-0.9 


3.5 



Table III: Expected and observed significances, in number of Gaussian standard deviations, estimated from pseudo- 
experiments. The uncertainty on the expected significances represents the central 68% quantile. 



5. Measurement of the production cross section 

The 2D analysis yields the following cross section measurements: 

CT^^ = 104.1 ± 42.3 (stat.) t^^;^ (syst.) ± 4.2 (lumi.) pb muon channel (3) 
CT^^ = 154.2 ± 56.0 (stat.) tf^l (syst.) ± 6.2 (lumi.) pb electron channel (4) 
CT^^ = 124.2 ± 33.8 (stat.) t^^l (syst.) ± 5.0 (lumi.) pb combined (5) 

When combining the electron and the muon decay channels, all systematic uncertainties are considered fully 
correlated with exception of the data-driven uncertainty on multi-jet QCD. 
In the BDT analysis, the following cross sections are measured: 

^BDT ^ gQ 4 _^ 35 {siai.) (syst.) ± 3.6 (lumi.) pb muon channel (6) 
^BDT ^ 2 ± 35 J (stat.) +}3-i (syst.) ± 2.4 (lumi.) pb electron channel (7) 
^BDT ^ 73 7 _^ 25.4 (stat.) tf^l (syst.) ± 3.1 (lumi.) pb combined (8) 

The main systematic uncertainties are the uncertainty on b-tagging, the jet energy scale and the modeling of 
the signal and backgrounds. The expected and observed significance when including all systematic uncertainties 



are given in Table III The measurements are consistent among them and with the standard model expectation 
in the 4- and 5-flavour schemes. Both confirm the Tevatron observation of the electroweak mode of top quark 
production. 

The measurements from the 2D and BDT analyses are then combined with the Best Linear Unbiased Esti- 
mation (BLUE) method [18]. The statistical correlations estimated from simulated samples is 0.51. Systematic 
uncertainties common to both methods are assumed to be 100% correlated. The combined result is: 

cr = 83.6 ± 29.8 (stat. -f- syst.) ± 3.3 (lumi.) pb 

This result can be used to derive an estimate of CKM matrix element \Vtb\- With the assumption that \Vtd\ 
and |Vts| are much smaller than |Vtfc| and using the NLO prediction in the 5-flavors scheme a*'* = 62.3^2^ pb [6], 
\Vtb\ is found to be 



\Vtb\ = \/^ - 1.16 ±0.22 (exp.)±0.02 (th.) 



6. Conclusion 

A first measurement of the production cross section of single top quark pp collisions at -^/s = 7 TeV was 
performed on an integrated luminosity of 36 pb~^ recorded at CMS. Two separate analyses were made, and the 
combination of the two measurements yields a = 83.6 ± 29.8 (stat. -I- syst.) ± 3.3 (lumi.) pb. This measurement 
is consistent with the SM prediction. 
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